
SEQUENCE LISTING 



Kasid, Usha — 
Gokhale, Prafulla 
Kumar, Deepak 
Broustas, Constanti nos 
Ahmad, Imran 
Dritschilo, Anatoly 
Rahman, Aquilur 

<120> GENE BRCC-2 AND DIAGNOSITIC AND THERAPEUTIC USES THEREOF 

<130> 224382 

<140> 10/679,865 
<141> 2003-10-06 

<150> PCT/US02/10848 
<151> 2002-04-08 

<150> 60/281,796 
<151> 2001-04-06 

<160> 2 

<170> Patentln version 3.2 

<210> 1 

<211> 862 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> CDS 

<222> (294) . . (620) 

<400> 1 



atgatgttgt 


gtttggagaa 


aagagagaag 


agaattgaat 


aagtacatga 


ccccttggaa 


60 


gctgttttca 


gttgaggatt 


ttggtgagta 


gccacttgaa 


atcaagggaa 


attccatgaa 


120 


tagcaggagc 


tgggaagaaa 


ggaagaggag 


gaaggatccc 


accacttttt 


tcttcccatc 


180 


taacaccgtg 


cttatgcaaa 


gcatagtcca 


tatgagatct 


gccttataaa 


aacttgctga 


240 


ataaagcgaa 


tggaagttaa 


attttggact 


ttcttccata 


atagcttcaa 


att atg 


296 



Met 
1 

gtq act ttg ttg cct ata gag gqc cag gaa gta cat ttc ttt gag ate 344 

val Thr Leu Leu Pro lie Glu Gly Gin Glu val His Phe Phe Glu lie 

5 10 15 

eta gaa tct gag tgt gtq etc tac aca gqa tgg ata gag cga gec tct 392 

Leu Glu Ser Glu Cys Val Leu Tyr Thr Gly Trp lie Glu Arg Ala Ser 
20 25 30 

gqc agt tec att tat cca gag gca aaa gca cgc ctg cca ctg gag gcg 440 

Gly Ser Ser lie Tyr Pro Glu Ala Lys Ala Arg Leu Pro Leu Glu Ala 
35 40 45 

etc ttg gat tec aac aaa gaa cct atg ttg cct aag gaa aca gtq ctt 488 

Leu Leu Gly Ser Asn Lys Glu Pro Met Leu Pro Lys Glu Thr Val Leu 

50 55 60 65 

tct ctt aaa agg tac aat ctt gqc tec tct gee atg aag egg aat gtt 536 

Ser Leu Lys Arg Tyr Asn Leu Gly Ser Ser Ala Met Lys Arg Asn val 



70 75 80 

cct gqa cat gtq ctt cag aga cct tec tat tta acc agg ata caa gtt 
Pro Gly His val Leu Gin Arg Pro Ser Tyr Leu Thr Arg lie Gin val 
85 90 95 

aca ttg tta tgc aat tec tct get gag gec ctg taa aaggagtgag 
Thr Leu Leu Cys Asn ser Ser Ala Glu Ala Leu 
100 105 

ttaggacaga tgcagcaaga tattaggaca gatttcgccc attattcagg ctgcacaaca 

cagaaggaaa cttgatttca ataagaccca attcttaaca gtcttttcta cccactttta 

cccataactt ttccaaattt ggttcaaatt gtgcagagaa acaataaaat ttttaaaaag 

gataaactgg ctagttaaaa gtaaatggca tttaattaaa acaaatcttg ca 

<210> 2 

<211> 108 

<212> PRT 

<213> Homo sapiens 

<220> 

<223> CDNA 
<400> 2 

Met val Thr Leu Leu Pro lie Glu Gly Gin Glu Val His Phe Phe Glu 
15 10 15 

lie Leu Glu ser Glu Cys Val Leu Tyr Thr Gly Trp lie Glu Arg Ala 
20 25 30 

Ser Gly Ser ser lie Tyr Pro Glu Ala Lys Ala Arg Leu Pro Leu Glu 
35 40 45 

Ala Leu Leu Gly Ser Asn Lys Glu Pro Met Leu Pro Lys Glu Thr Val 
50 55 60 

Leu Ser Leu Lys Arg Tyr Asn Leu Gly Ser ser Ala Met Lys Arg Asn 
65 70 75 80 

val Pro Gly His val Leu Gin Arg Pro Ser Tyr Leu Thr Arg lie Gin 
85 90 95 



val Thr Leu Leu cys Asn ser ser Ala Glu Ala Leu 
100 105 




The Biotechnology Systems Branch of the Scientific and Technical Information 
Center (STIC) detected errors when processing the following computer readable 
form: 



Application Serial Number; 
Source:. Qa^l 



Date Processed by STIC: lQ -Z^ - O^S 

THE ATTACHED PRINTOUT EXPLAINS DETECTED ERRORS. 

PLEASE FORWARD THIS INFORMATION TO THE APPLICANT BY EITHER: 

1) INCLUDING A COPY OF THIS PRINTOUT IN YOUR NEXT COMMUNICATION TO THE 
APPLICANT, WITH A NOTICE TO COMPLY or, 

2) TELEPHONING APPLICANT AND FAXING A COPY OF THIS PRINTOUT, WITH A 
NOTICE TO COMPLY 

FOR CRF SUBMISSION AND PATENTIN SOFTWARE QUESTIONS, PLEASE CONTACT 
MARK SPENCER, 703-308-4212. 



TO REDUCE ERRORED SEQUENCE LISTINGS, PLEASE USE THE CHECKER 
VERSION 4.1 PROGRAM , ACCESSIBLE THROUGH THE U.S. PATENT AND 
TRADEMARK OFFICE WEBSITE. SEE BELOW FOR ADDRESS: 

Iittp:/ Avw\v.uspto.gov/vveb/offices/pac/checker/chkr41note.htm 



Applicants submitting genetic sequence information electronically on diskette or CD-Roni should be aware that there is 

a possibility that the disk/CD-Rom may have been affected by treateient given to all incoming mail. 

Please consider using alternate methods of submission for the disk/CD- Rom or replacement disk/CD-Rom. 

Any reply including a sequence listing in electronic form should NOT be sent to the 20231 zip code address for the 

United States Patent and Trademark Office, and instead should be sent via the following to the indicated addresses: 

1. EFS-Bio (<lUtp://\vww.uspto>gov/ebc/efs/downloads/documentsJUm> 1 EFS Submission 
User Manual -ePAVE) 

2. U.S. Postal Service: Commissioner for Patents, P.O. Box 1450, Alexandria, VA 22313-1450 

3. Hand Carry directly to (EFFECTIVE 12/01/2003): 

U.S. Patent and Trademark Office, Box Sequence, Customer Window, Lobby, Room 1J303, Crystal Plaza Two, 
20 1 1 South Clark Place, Arlington, V A 22202 ■ . 

4. Federal Express, United Parcel Service, or other delivery service to; U.S. Patent and Trademark Office, 
Box Sequence, Room 1B03-Mailroom, Crystal Plaza Two, 201 1 South ClarkEJace, Arlington, VA 22202 

Revised 10/08/2003 



Raw Sequence Listing Error Summary 



ERROR DETECTED SUGGESTED CORRECTION SERIAL NUMBER: 

ATTN: NEW RULES CASES: PLEASE DISREGARD ENGLISH -ALPHA" HEADERS, WHICH WERE INSERTED BY PTO SOFTWARE 

1 Wrapped Nucleics The number/text at the end of each line "wrapped" down to the next line. This may occur if your file 

Wrapped Aminos was retrieved in a word processor after creating it. Please adjust your right margin to .3; this will 
prevent trapping." 

2 Invalid Line Length The rules require that a Jine not exceed 72 characters in length. This includes white spaces. 

3 Misaligned Amino The numbering under each 5* amino acid is misaligned. Do not use tab codes between numbers; 

Numbering use space characters, instead. 

4 Non-ASCII The submitted file was not saved in ASCII(DOS) text, as required by the Sequence Rules. Please 

ensure your subsequent submission is saved in ASCII text 

5 Variable Length Sequencers) contain n's or Xaa's representing more than one residue. Per Sequence Rules, 

each n or Xaa can only represent a single residue. Please present the maximum number of each 
residue having variable length and indicate in the <220>-<223> section that some may be missing. 

6 Patentin 2.0 A "bug" in Patentln version 2.0 has caused. the <220>-<223> section to be missing from amino acid 

"bug" sequencers) . Normally, Patentln would automatically generate this section from the 

previously coded nucleic acid sequence. Please manually copy the relevant <220>-<223> section to 
the subsequent amino acid sequence. This applies to the mandatory <220>-<223> sections for 
Artificial or Unknown sequences. 



_Skipped Sequences 
(OLD RULES) 



Sequence(s) m 



missing. If intentional, please insert the following lines for each skipped sequence: 



(2) INFORMATION FOR SEQ ID NO:X: (insert SEQ ID NO where "X" is shown) 
(i) SEQUENCE CHARACTERISTICS: (Do not insert any subheadings under this heading) 

(xi) SEQUENCE DESCRIPTION :SEQ ID NO:X: (insert SEQ ID NO where "X" is shown) 
This sequence is intentionally skipped 

Please also adjust the "(ii) NUMBER OF SEQUENCES:" response to Include the skipped sequences. 

^Skipped Sequences Sequence(s) missing. Jf intentional, please insert the following lines for each skipped sequence. 

"(NEW RULES) <210> sequence id number 

<400> sequence id number 

000 



9 Use of n*s or Xaa's 

(NEW RULES) 



1 0 /V Invalid <2I3> 
Response 



12 



Use of <220> 



Patentln 2.0 
"bug" 



Use of n*s and/or Xaa's have been detected in the Sequence Listing. 

Per 1 .823 of Sequence Rules, use of <220>-<223> is MANDATORY if n*s or Xaa's are present. 

In <220> to <223> section, please explain location of n or Xaa, and which residue n or Xaa represents. 

Per 1.823 ofSequence Rules, the only valid <213> responses are: Unknown, Artificial Sequence, or 
scientific name (Genus/species). <220>-<223> section is required when <213> response is Unknown or 
is Artificial Sequence 



Sequence(s) _ 



missing the <220> "Feature" and associated numeric identifiers and responses. 



Use of <220> to <223> is MANDATORY if <2|3> "Organism" response-is "Artificial Sequence" or 
"Unknown." Please explain source of genetic material in <J20> to <223> section. 
(See "Federal Register," 0&V01/1998, Vol. 63, No. 104, pp. 29634^32) (Sec. 1 .823 of Sequence Rules) 

Please do not use "Copy to Disk" function of Patentln version 2.0. This causes a corrupted file, 
resulting in missing mandatory numeric identifiers and responses (as indicated on raw sequence 
listing). Instead, please use "File Manager" or any other manual means to copy file to floppy disk. 



13 



Misuse of n/Xaa "n" can only represent a single nucleotide ; "Xaa" can only represent a single amino acid 



AMC - Biotechnology Systems Branch - 09/09/2003 



Page 1 of 4 




IFWO 



RAW SEQUENCE LISTING DATE: 10/21/2003 

PATENT APPLICATION: US/10/679,865 TIME: 11:59:59 



C— > 
C— > 



3 
4 
5 
6 
7 
8 
9 
11 
13 
15 
15 
15 
16 
18 
19 
21 
23 
25 
26 
27 
28 
31 
32 
33 
35 
36 
38 
40 
42 
44 
45 
46 



Input Set : A:\224382.ST25.txt 

Output Set: N:\CRF4\10212003\J679865.raw 

<110> APPLICANT: KASID, Usha 
GOKHALE, Prafulla 
KUMAR, Depak 
BROUSTAS, Constant! no s 
AHMAD, Imran 
DRITSCHILO, Anatoly 
RAHMAN, Aquilur ■ 

<120> TITLE OF INVENTION: GENE BRCC-2 AND DIAGNOSTIC AND THERAPEUTIC USES THEREOF 
<130> FILE REFERENCE: 224382 

<140> CURRENT APPLICATION NUMBER: US/10/679 , 865 
<141> CURRENT PILING DATE: 2003-10-03 

PRIOR APPLICATION NUMBER: PCT/US 02/10848 
PRIOR FILING DATE: 2002-04-08 
PRIOR APPLICATION NUMBER: US 60/281,796 
PRIOR FILING DATE: 2001-04-06 
NUMBER OF SEQ ID NOS : 2 
SOFTWARE: Patentln version 3.2 
SEQ ID NO: 1 
LENGTH: 8 62 
TYPE: DNA 



\0 



<150> 
<151> 
<150> 
<151> 
<160> 
<170> 
<210> 
<211> 

<212> lire: uinm^- — w 
<213> ORGANISM:( ^DNAA-f^g^ 

<220> FEATURE: V *S 

<221> NAME /KEY: CDS 

<222> LOCATION: (294).. (620) 

<400> SEQUENCE: 1 

atgatgttgt gtttggagaa aacjagagaag agaattgaat aagtacatga ccccttggaa 
gctgttttca gttgaggatt ttggtgagta gecacttgaa atcaagggaa attccatgaa 
tagcaggagc tgggaagaaa ggaagaggag gaaggatccc accacttttt tcttcccatc 
taacaccgtg ettatgeaaa gcatagtcca tatgagatct gecttataaa aacttgetga 
ataaagegaa tggaagttaa attttggact ttcttccata atagcttcaa att atg 

Met 



Bom Ntf Comply 



60 
120 
180 
240 
296 



48 


gtg 


act 


ttg 


ttg 


cct 


ata 


gag 


ggc 


cag 


gaa 


gta 


cat 


ttc 


ttt 


gag 


ate 


344 


49 


Val 


Thr 


Leu 


Leu 


Pro 


He 


Glu 


Gly 


Gin 


Glu 


Val 


His 


Phe 


Phe 


Glu 


He 




50 








5 










10 










15 








52 


eta 


gaa 


tct 


gag 


tgt 


gtg 


etc 


tac 


aca 


gga 


tgg 


ata 


gag 


cga 


gee 


tct 


392 


53 


Leu 


Glu 


Ser 


Glu 


Cys 


Val 


Leu 


Tyr 


Thr 


Gly 


Trp 


He 


Glu 


Arg 


Ala 


Ser 




54 






20 










25 










30 










56 


ggc 


agt 


tec 


att 


tat 


cca 


gag 


gca 


aaa 


gca 


cgc 


ctg 


cca 


ctg 


gag 


gcg 


440 


57 


Gly 


Ser 


Ser 


lie 


Tyr 


Pro 


Glu 


Ala 


Lys 


Ala 


Arg 


Leu 


Pro 


Leu 


Glu 


Ala 




58 




35 










40 










45 












60 


etc 


ttg 


ggt 


tec 


aac 


aaa 


gaa 


cct 


atg 


ttg 


cct 


aag 


gaa 


aca 


gtg 


ctt 


488 


61 


Leu 


Leu 


Gly 


Ser 


Asn 


Lys 


Glu 


Pro 


Met 


Leu. 


Pro 


Lys 


Glu 


Thr 


Val 


Leu 





file://C:\CRF4\Outhold\VsrJ679865.htm 



10/21/03 



RAW SEQUENCE LISTING DATE: 10/21/2003 

PATENT APPLICATION: US/10/679,865 TIME: 11:59:59 

Input Set : A:\224382.ST25.txt 

Output Set: N:\CRF4\10212003\J679865.raw 

62 50 55 60 65 

66 tct ctt aaa agg tac aat ctt ggc tec tct gec atg aag egg aat gtt 53.6 

67 Ser Leu Lys Arg Tyr Asn Leu Gly Ser Ser Ala Met Lys Arg Asn Val 

68 70 75 80 

70 cct gga cat gtg ctt cag aga- cct tec tat tta ace agg ata caa gtt 584 

71 Pro Gly His Val Leu Gin Arg Pro Ser Tyr Leu Thr Arg lie Gin Val 

72 85 90 95 

74 aca ttg tta tgc aat tec tct get gag gec ctg taa aaggagtgag 630 

75 Thr Leu Leu Cys Asn Ser Ser Ala Glu Ala Leu 

76 100 105 

78 ttaggacaga tgcagcaaga tattaggaca gatttcgccc attattcagg ctgcacaaca 690 
80 cagaaggaaa cttgatttca ataagaccca attcttaaca gtcttttcta cccactttta 750 
82 cccataactt ttccaaattt ggttcaaatt gtgcagagaa acaataaaat ttttaaaaag 810 
84 gataaactgg ctagttaaaa gtaaatggca tttaattaaa acaaatcttg ca 862 

87 <210> SEQ ID NO: 2 

88 <211> LENGTH: 108 

89 <212> TYPE: PRT ^ — 

90 <213> ORGANISM: (cDNAj) SO-T^^ 
92 <4 00> SEQUENCE: 2 

94 Met Val Thr Leu Leu Pro He Glu Gly Gin Glu Val His Phe Phe Glu 

95 1 5 10 15 

98 He Leu Glu Ser Glu Cys Val Leu Tyr Thr Gly Trp He Glu Arg Ala 

99 20 25 30 

102 Ser Gly Ser Ser He Tyr Pro Glu Ala Lys Ala Arg Leu Pro Leu Glu 

103 35 40 45 

106 Ala Leu Leu Gly Ser Asn Lys Glu Pro Met Leu Pro Lys Glu Thr Val 

107 50 55 60 

110 Leu Ser Leu Lys Arg Tyr Asn Leu Gly Ser Ser Ala Met Lys Arg Asn 

111 65 70 .75 80 

114 Val Pro Gly His Val Leu Gin Arg Pro Ser Tyr Leu Thr Arg He Gin 

115 85 90 95 

118 Val Thr Leu Leu Cys Asn Ser Ser Ala Glu Ala Leu 

119 100 105 



file://C:\CRF4\Ou&old\VsrJ679865.htm 
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VERIFICATION SUMMARY DATE: 10/21/2003 

* PATENT APPLICATION: US/10/679,865 TIME: 12:00:00 

Input Set : A:\224382.ST25.txt 

Output Set: N:\CRF4\10212003\J679865.raw 

L:15 M:270 C: Current Application Number differs, Replaced Current Application No 
L:15 M:271 C: Current Filing Date differs, Replaced Current Filing Date 



file://C:\CRF4\Outhold\VsrJ679865.htm 
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